Autocorrelation and double autocorrelation based spectral representations for a noisy word recognition system

نویسندگان

  • Tetsuya Shimamura
  • Ngoc Dinh Nguyen
چکیده

Two methods of spectral analysis for noisy speech recognition are proposed and tested in a speaker independent word recognition experiment under an additive white Gaussian noise environment. One is Mel-frequency cepstral coefficients (MFCC) spectral analysis on the autocorrelation sequence of the speech signal and the other is MFCC spectral analysis on its double autocorrelation sequence. The word recognition experiment shows that both of the proposed methods achieve better results than the conventional MFCC spectral analysis on the input speech signal.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

Novel Feature Vector Set Extraction using Spectral Peaks in Autocorrelation Domain

This paper presents a new feature vector set for noisy speech recognition in autocorrelation domain. The autocorrelation domain is well known for its pole preserving and noise separation properties. In this paper we will use the autocorrelation domain as an appropriate candidate for robust feature extraction. In our approach, extraction of mel frequency cepstral coefficients (MFCC) of the speec...

متن کامل

Robust feature extraction based on spectral peaks of group delay and autocorrelation function and phase domain analysis

This paper presents a new robust feature set for noisy speech recognition in phase domain along with spectral peaks obtained from group delay and autocorrelation functions. The group delay domain is appropriate for formant tracking and autocorrelation domain is well-known for its pole preserving and noise separation properties. In this paper, we report on appending spectral peaks obtained in ei...

متن کامل

Entropy based combination of tandem representations for noise robust ASR

In this paper, we present an entropy based method to combine tandem representations of the recently proposed Phase AutoCorrelation (PAC) based features and MelFrequency Cepstral Coefficients (MFCC) features. PAC based features, derived from a nonlinear transformation of autocorrelation coefficients and shown to be noise robust, improve their robustness to additive noise in their tandem represen...

متن کامل

On the Use of Asymmetric Windows for Robust Speech Recognition

This paper deals with the problem of searching for a suitable window for robust speech recognition in noisy conditions. A set of asymmetric windows, socalled DDRc,w, are proposed which are controlled by two parameters, center c and width w. These windows are derived from the DDR window used in the higher-lag autocorrelation spectrum estimation (HASE) method and act over the OSA (OneSided Autoco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010